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MICRQARRAYS AND TTSRfi THEREFOR 

Fiel<a of the Invention 

The invention disclosed herein relates to new 
5 methods of using microarray technologies. The methods 
are useful for identifying and characterizing specific 
antibodies as well as the characterization of different 
tissues or cells by protein or nucleic acid analysis. 

10 Background of the Invention 

Recent breakthroughs in nucleic acid sequencing 
technology have made possible the sequencing of entire 
genomes from a variety of organisms, including humans. 

15 The potential benefits of a complete genome sequence are 
many, ranging from applications in medicine to a greater 
understanding of evolutionary processes. These benefits 
cannot be fully realized, however, without an 
understanding of how and where these newly sequenced 

20 genes function. 

Traditionally, functional understanding started with 
recognizing an activity, isolating a protein associated 
with that activity, then isolating the gene, or genes, 
encoding that protein. The isolated protein was also 
25 used to generate antibody reagents. Specific antibodies 
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and fragments of the isolated gene were both employed to 
study tissue expression and function. 

Several methods have been used to study protein 
expression patterns including in situ hybridization 
5 studies of tissue sections and Northern blots. These 
methods are both time consuming and require relatively 
large amounts of material to perform successfully. 

Antibodies that bind to specific antigens have been 
produced by a variety of methods including immunization 

10 of animals, fusion of mammalian spleen cells to 
immortalized cells to produce hybridomas, random peptide 
generation using phage or bacterial display and 
constrained peptide libraries. Regardless of how the 
desired antibody is generated, the methods currently 

15 available to identify one with a particular binding 
specificity are generally laborious and incapable of the 
simultaneous testing of large numbers of unknowns. 

One method involves binding the antigen to a porous 
membrane, such as nitrocellulose, contacting the membrane 

20 with a source of test antibodies, then determining 
whether or not any of the test antibodies has bound to 
the antigen. This method only allows the testing of one 
source of test antibodies per piece of porous membrane, 
making the method both inconvenient and wasteful of 

25 materials. 
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Antibody /antigen reactions can also be evaluated in 
plastic plates, such as 96-well microtiter plates, using 
methods similar to those described above. This method is 
5 likewise limited in the number of samples that can be 
tested in any one assay, thus requiring many assays to 
fully evaluate a large number of antibody unknowns. 
Chang (US Patent No. 4,591,570, issued 5/27/86) describes 
an array of a limited number of characterized antibodies 
10 to known antigens on a glass surface that can be used to 
bind to specific antigens on the surface of whole cells. 

i '(I 

4^ Recently new technologies have arisen that allow the 

OB creation of microarrays containing thousands or millions 

of different elements. Such array technology has been 
15 applied mainly to forming arrays of individual nucleic 
ng acids (see, for example, Marshall and Hodgson, Nature 

j| Biotech. 16:27-31, 1998; Ramsay, Nature Biotech. 16:40- 

44, 1998), in particular short oligonucleotides 

synthesized in si tu . 

20 Methods are needed to simply and rapidly screen very 

large numbers of uncharacterized antibodies for those 
specific for a given antigen as well as for the 
characterization of tissues and cells by nucleic acid 
and/or protein analysis. The invention described herein 

25 addresses that need. 
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Brief Description of the Invention 

The invention disclosed herein comprises methods of 
using microarrays to simplify analysis and 
characterization of genes and their function. In one 
5 aspect of the invention the methods are used to identify 
and characterize antibodies having binding affinity for a 
specific target antigen. This method comprises 

contacting an array of uncharacterized antibodies bound 
to a solid surface with at least one target antigen and 
10 identifying the antibodies to which the target antigen 
binds. The method can be performed under a variety of 
conditions to identify antibodies with a range of binding 
affinities . 

A second aspect of the invention comprises a method 
15 of determining gene expression at the protein level 
comprising contacting an array of characterized or 
uncharacterized antibodies on a solid surface with one or 
more proteins and identifying the antibodies to which 
said protein (s) binds. This method can be further used 
20 to compare the protein expression in two different 
populations of cells, such as normal cells and cancer 
cells or resting cells and stimulated cells. A related 
embodiment can be used as a tool in the diagnosis of 
various disorders . 

25 A further aspect of the invention comprises a method 

of determining gene expression at the protein level 
comprising contacting a microarray of nucleic acid 
samples derived from a variety of different sources with 
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one or more nucleic acid probes then identifying the 
sample or samples to which the probe binds. 



Brief Description of the Figures 

5 Figures lA, IB, and IC show microarrays of 

antibodies bound to positively charged nylon, reacted 
with antigen and detected by non- fluorescent means. 

Figure 2 shows a microarray produced using a robotic 
arraying apparatus. Antigen binding is detected by non- 
10 fluorescent means. 

Figure 3 shows the ability of the antibody 
microarrays to evaluate relative binding affinities to a 
specific antigen. 

Figure 4 shows a microarray of polyclonal antibodies 
15 in comparison to a microarray of monoclonal antibodies. 

Figures 5A, 5B and 5C show a microarray of 
antibodies reacted with a cell lysate under conditions 
that vary the amount of background binding. 

P etailed Pescy iptipn. o£ thg ixxv^ntion 

20 The present invention discloses methods of using 

microarrays to simplify analysis and characterization of 
genes and their function. In a first aspect of the 
invention the methods are used for identifying and 
characterizing antibodies having binding specificity to a 
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particular antigen or set of antigens. This method 
utilizes microarray technology to create ordered matrices 
of large numbers of uncharacterized antibodies which can 
then be contacted with antigen under a variety of 
5 conditions. The method is rapid and simple to perform 
and is applicable to the simultaneous screening of very 
large numbers of antibodies. 

Briefly, uncharacterized antibodies are bound to a 
solid surface in an array format consisting of discrete 
10 spots whose spatial location can be easily identified. 
Each location represents an antibody from a known source, 
PI such as a particular hybridoma growing in a well in a 96- 

yj^ well microtiter plate. The space between the antibody 

spots is treated to minimize non-specific binding to the 
U1 15 solid support. The arrayed antibodies are then contacted 

p with an antigen, or a set of antigens, for which specific 

antibodies are sought. The antigen solution is left in 
4^ contact with the array for an amount of time sufficient 

yj to allow antigen: antibody complexes to form (generally 10 

20 minutes to 2 hours) , then the unbound antigen is washed 
away under suitable conditions. Bound antigen is 

detected at a particular antibody spot using one of a 
variety of detection methods, thus identifying the source 
of an antibody specific for the particular antigen. 

25 The term "antibody" is used herein in the broadest 

sense and specifically includes intact monoclonal 
antibodies, polyclonal antibodies, multispecif ic 

antibodies {e.g. bispecific antibodies) formed from at 
least two intact antibodies, and antibody fragments. 
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including single chain antibodies, so long as they 
exhibit the desired binding properties as described 
herein , 

Various procedures well-known in the art may be used 
5 for the production of polyclonal antibodies to an epitope 
or antigen of interest. A host animal of any of a number 
of species, such as rabbits, goats, sheep, horse, cow, 
mice, rats, etc. is immunized by injection with an 
antigenic preparation which may be derived from cells or 
10 microorganisms, or may be recombinantly or synthetically 
produced. Various adjuvants well known in the art may be 
ni used to enhance the production of antibodies by the 

\j% immunized host, for example, Freund^s adjuvant (complete 

fifz 

and incomplete) , mineral gels such as aluminum hydroxide, 
15 surface active substances such as lysolecithm, pluronic 

P polyols, polyanions, peptides, oil emulsions, keyhole 

limpet hemocyanins, dinitrophenol , liposomes, potentially 
useful human adjuvants such as BCG (Bacille Calmette- 

y5 Guerin) and Propionihacterium a.ca.nes , and the like, 

20 The term "monoclonal antibody" as used herein refers 

to an antibody obtained from a population of 
substantially homogeneous antibodies, i.e., the 
individual antibodies comprising the population are 
identical except for possible naturally occurring 

25 mutations that may be present in minor amounts. 
Monoclonal antibodies are highly specific, being directed 
against a single antigenic site. Furthermore, in 

contrast to conventional (polyclonal) antibody 

preparations which typically include different antibodies 
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directed against different determinants (epitopes) , each 
monoclonal antibody is directed against a single 
determinant on the antigen. Preferred antibodies are 
mAbs, which may be of any immunoglobulin class including 
5 IgG, IgM, IgE, IgA, and any subclass or isotype thereof. 

In addition to their specificity, monoclonal 
antibodies are advantageous in that they are synthesized 
by hybridoma culture, uncontaminated by other 
immunoglobulins. The modifier "monoclonal" indicates the 

10 character of the antibody as being obtained from a 
substantially homogeneous population of antibodies, and 
is not to be construed as requiring production of the 
antibody by any particular method. For example, the 
monoclonal antibodies to be used in accordance with the 

15 present invention may be made by the hybridoma method 
first described by Kohler et al . , Nature, 256:495 (1975), 
or may be made by recombinant DNA methods (see, e.gr. , 
U.S. Patent No. 4,816,567, incorporated by reference 
herein) . The "monoclonal antibodies" may also be 

20 isolated from phage antibody libraries using the 
techniques described in Clackson et al . , Nature^ 352:624- 
628 (1991) and Marks et al , , J. Mol , Biol,, 222:581-597 
(1991), for example. 

The monoclonal antibodies contemplated for use 
25 herein specifically include "chimeric" antibodies 
(immunoglobulins) in which a portion of the heavy and/or 
light chain is identical with or homologous to 
corresponding sequences in antibodies derived from a 
particular species or belonging to a particular antibody 
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class or subclass, while the remainder of the chain (s) is 
identical with or homologous to corresponding sequences 
in antibodies derived from another species or belonging 
to another antibody class or subclass, as well as 
5 fragments of such antibodies, so long as they exhibit the 
desired biological activity (U.S. Patent No. 4,816,567; 
Morrison et al . , Proc. Natl. Acad. Sci . USA, 81:6851-6855 
(1984) ) . 

"Humanized" forms of non-human (e.g., murine) 

10 antibodies are chimeric immunoglobulins, immunoglobulin 
chains or fragments thereof (such as Fv, Fab, Fab', 
F(ab*)2 or other antigen-binding subsequences of 
antibodies) which contain minimal sequence derived from 
non-human immunoglobulin. For the most part, humanized 

15 antibodies are human immunoglobulins (recipient antibody) 
in which residues from a complementarity- determining 
region (CDR) of the recipient are replaced by residues 
from a CDR of a non-human species (donor antibody) such 
as mouse, rat or rabbit having the desired specificity, 

20 affinity, and capacity. In some instances, Fv framework 
region (FR) residues of the human immunoglobulin are 
replaced by corresponding non-human residues. 
Furthermore, humanized antibodies may comprise residues 
which are found neither in the recipient antibody nor in 

25 the imported CDR or framework sequences. These 
modifications are made to further refine and maximize 
antibody performance. In general, the humanized antibody 
will comprise substantially all of at least one, and 
typically two, variable domains, in which all or 

30 substantially all of the CDR regions correspond to those 
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of a non-human immunoglobulin and all or substantially 
all of the FR regions are those of a human immunoglobulin 
sequence. The humanized antibody optimally also will 
comprise at least a portion of an immunoglobulin constant 
5 region (Fc) , typically that of a human immunoglobulin. 
For further details, see Jones et al . , Nature, 321:522- 
525 (1986); Reichmann et al . , Nature, 332:323-329 

(1988); and Presta, Curr. Op, Struct. Biol., 2:593-596 
(1992) . The humanized antibody includes a PRIMATIZED™ 
10 antibody wherein the antigen-binding region of the 
antibody is derived from an antibody produced by 
immunizing macaque monkeys with the antigen of interest. 

"Antibody fragments" comprise a portion of an intact 
antibody, preferably the antigen binding or variable 
^■^ 15 region of the intact antibody. Examples of antibody 

P fragments include Fab, Fab*, F(ab')2/ fragments; 

diabodies; linear antibodies (Zapata et al . Protein Eng. 
8 (10) : 1057-1062 (1995)); single-chain antibody molecules; 
4^ and multispecif ic antibodies formed from antibody 

20 f r agment s . 

Particularly preferred in the practice of the 
invention are single-chain antibodies. "Single-chain" or 
"sFv" antibodies are antibody fragments comprising the Vh 
and Vl domains of an antibody, wherein these domains are 
25 present in a single polypeptide chain. Preferably, the 
Fv polypeptide further comprises a polypeptide linker 
between the V„ and domains which enables the sFv to 

form the desired structure for antigen binding. For a 
review of sFvs see Pluckthun in The Pharmacology of 
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Monoclonal Antibodies, vol. 113, Rosenburg and Moore 
eds.. Springer- Verlag, New York, pp. 269-315 (1994). 

Large quantities of single chain antibodies with 
uncharacterized randomized binding specificity can be 
5 produced using a number of methodologies known in the 
art. Recombinant antibody libraries can be created in 
filamentous phage particles (Daniels and Lane, Methods 
9 (3) :494-507, 1996; Reichmann and Weill, Biochemistry 
32 (34) : 8848-8855; Rader and Barbas , Curr Opin Biotechnol 

10 9 (4) : 503-508, 1997; Iba and Kurosawa, Immunol Cell Biol 
75 (2) :217-221, 1997, WO 90/05144, WO 92/01047, 

WO 92/20791, WO 93/19172, GB 9722131.8, GB9810228.8 and 
GB 9810223.9, all of which are incorporated by reference 
herein in their entirety) , for example, or similarly in 

15 yeast, bacteria, and the like. Other methods for 

creating random libraries of sFvs include various solid 
state synthesis methods. 

The term "diabodies" refers to small antibody 
fragments with two antigen-binding sites, which fragments 

20 comprise a heavy- chain variable domain (Vh) connected to a 
light -chain variable domain (Vl) in the same polypeptide 
chain (V^ - Vl) . By using a linker that is too short to 
allow pairing between the two domains on the same chain, 
the domains are forced to pair with the complementary 

25 domains of another chain and create two antigen-binding 
sites. Diabodies are described more fully in, for 
example, EP 404,097; WO 93/11161; and Hollinger et al . , 
Proc. Natl. Acad, Sci. USA, 90:6444-6448 (1993). 
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The antibodies employed in the invention can be 
isolated prior to creating a microarray. An "isolated" 
molecule, whether an antibody, antigen or nucleic acid, 
is one which has been identified and separated and/or 
5 recovered from a component of its natural environment. 
Contaminant components of its natural environment are 
materials which would interfere with particular uses for 
the molecule, and may include enzymes, hormones, and 
other proteinaceous or nonproteinaceous solutes. In 
10 preferred embodiments, a protein will be purified (1) to 
greater than 95% by weight of protein as determined by 
J5 the Lowry method, and most preferably more than 9 9% by 

weight, (2) to a degree sufficient to obtain at least 15 
5| residues of N-terminal or internal amino acid sequence by 

f:^ 15 use of a spinning cup sequenator, or (3) to homogeneity 

s by SDS-PAGE under reducing or nonreducing conditions 

S| using Coomassie blue or, preferably, silver stain. 

^.^ Isolated protein includes the protein in situ within 

recombinant cells since at least one component of the 
20 protein's natural environment will not be present. 
Ordinarily, however, isolated protein will be prepared by 
at least one purification step. Unpurified antibodies, 
such as those found in serum, can also be employed in the 
present invention . 

25 By "isolated" in reference to nucleic acid is meant 

a polymer of 14, 17, 21 or more contiguous nucleotides, 
including DNA or RNA that is isolated from a natural 
source or that is synthesized. The isolated nucleic acid 
of the present invention is unique in the sense that it 

30 is not found in a pure or separated state in nature. Use 
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of the term "isolated" indicates that a naturally 
occurring sequence has been removed from its normal 
cellular ( i.e. . chromosomal) environment. Thus, the 
sequence may be in a cell-free solution or placed in a 
different cellular environment. The term does not imply 
that the sequence is the only nucleotide sequence 
present, but that it is essentially free (about 90 - 95% 
pure at least) of non-nucleotide material naturally 
associated with it and thus is meant to be distinguished 
from isolated chromosomes. 

One particularly useful method of isolating 
antibodies, such as single chain antibodies from a cell 
extract, is affinity purification. Resins suitable for 
antibody purification are well known in the art, for 
example, protein A sepharose. A recombinant antibody can 
be engineered to contain an affinity purification tag to 
facilitate its purification. Resins suitable for 

antibody purification are well known in the art, for 
example, protein A sepharose. 

Affinity purification tags are generally peptide 
sequences that can interact with a binding partner 
immobilized on a solid support. Synthetic DNA sequences 
encoding multiple consecutive single amino acids, such as 
histidine, when fused to the expressed protein, may be 
used for one-step purification of the recombinant protein 
by high affinity binding to a resin column, such as 
nickel sepharose. An endopeptidase recognition sequence 
can be engineered between the polyamino acid tag and the 
protein of interest to allow subsequent removal of the 
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leader peptide by digestion with enterokinase , and other 
proteases. Sequences encoding peptides such as the 
chitin binding domain (which binds to chitin) , biotin 
(which binds to avidin and strepavidin) , and the like can 
5 also be used for facilitating purification of the protein 
of interest. The affinity purification tag can be 
separated from the protein of interest by methods well 
known in the art, including the use of inteins (protein 
self -splicing elements, Chong, et al. Gene 192:271-281, 
10 1997) . 

By using an amount of resin with binding sites 
PI sufficient for only a small portion of the antibody 

yi present in the unpurified mixture, the process of 

isolation can be used to simultaneously normalize yield 
^ 15 and isolate the antibody. For example, although each 

p sample will contain a different and unknown amount of 

antibody protein, the samples can be contacted with an 
=^7 amount of resin whose maximum binding capacity is 10 mgs . 

y1 Thus any antibody greater than this amount will pass 

20 through the resin unbound. The maximum bound amount can 

then be eluted from the resin. 

Methods for creating microarrays are known in the 
art including printing on a solid surface using pins 
(passive pins, quill pins, and the like) or spotting with 
25 individual drops of solution. Passive pins draw up 
enough sample to dispense a single spot. Quill pins draw 
up enough liquid to dispense multiple spots. Bubble 
printers use a loop to capture a small volume which is 
dispensed by pushing a rod through the loop. 
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Microdispensing uses a syringe mechanism to deliver 
multiple spots of a fixed volume. In addition, solid 
supports, can be arrayed using pizoelectric (ink jet) 
technology, which actively transfers samples to a solid 
5 support . 

One method is described in Shalon and Brown (WO 
95/35505, published 12/28/95) which is incorporated 
herein by reference in its entirety. The method and 
apparatus described in Shalon and Brown can create an 
10 array of up to six hundred spots per square centimeter on 
a glass slide using a volume of 0.01 to 100 nl per spot. 
Suitable concentrations of antibody range from about 1 
ng/|iC to about 1 |J.g/|iC. In the present invention, each 
spot can contain one or more than one distinct antibody. 

15 Other methods of creating arrays are known in the 

art, including photolithographic printing (Pease, et al, 
PNAS 91 (11) : 5022-5026 , 1994) and in situ synthesis. 
While known in situ synthesis methods are less useful for 
synthesizing polypeptides long enough to be antibodies, 

20 they can be used to make polypeptides up to 50 amino 
acids in length, which can serve as binding proteins as 
described below. 

The microarrays can be created on a variety of solid 
surfaces such as plastics (eg. polycarbonate) , complex 
25 carbohydrates (eg. agarose and sepharose) , acrylic resins 
(eg. polyacrylamide and latex beads), and nitrocellulose. 
Preferred solid support materials include glass slides, 
silicon wafers, and positively charged nylon. Specific 



16 



examples of suitable solid supports are described in the 
Examples below. 

Methods for covalent attachment of antibodies to a 
solid support are known in the art. Examples of such 
methods are found in Bhatia, et al. Anal. Biochem. 
178 (2) :408-413, 1989; Ahluwalia, et al, Biosens . 
Bioelectron. 7 (3) : 207-214 , 1992; Jonsson, et al, Biochem. 
J. 227 (2) :373-378, 1985; and Frei j -Larsson, et al, 
Biomaterials 17 (22 ): 2199-2207 , 1996, all of which are 
incorporated by reference herein in their entirety. 
Proteins may additionally be attached to a solid support 
using methods described in the examples below. 

Methods of reducing non-specific binding to a solid 
surface are well known in the. art and include washing the 
arrayed solid surface with bovine serum albumin (BSA) , 
reconstituted non-fat milk, salmon sperm DNA, porcine 
heparin, and the like (see Ausubel, et al. Short 
Protocols in Molecular Biology, 3rd ed. 1995) . 

The arrays used to identify antigen- specific 
antibodies are contacted with a solution containing one 
or more known antigens in order to identify antibodies in 
the array with binding specificity for the antigen. The 
antigens are often proteins, although they may also be 
organic chemical compounds, carbohydrates, nucleic acids, 
and the like. They may be isolated or semi -isolated, 
recombinant or naturally occurring. The amount of 

antigen used can vary from about 1 - 10 0 ng/jxl. The 
antigen is left in contact with the array for an amount 
of time sufficient for antibody : antigen complexes to 
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form, should one of the antibodies in the array be 
specific for the antigen. The amount of time sufficient 
for this purpose will range from 5 minutes to 24 hours, 
and will generally be from 0.5 to 2 hours. 

5 One antigen of particular interest in the practice 

of the invention is recombinant protein, either a full- 
length gene product or a fragment thereof, for example an 
Expressed Sequence Tag (or EST fragment) . EST fragments 
are relatively short cDNA sequences that have been 

10 randomly generated and sequenced, generally as part of an 
ongoing effort to map an entire genome (Adams, et al. 
Science 252 (5013) : 1651-1656 , 1991). Large numbers of 
these sequences are available in public databases. The 
identity of the proteins encoded by the vast majority of 

15 these sequences is unknown. The following discussion, 
although directed to the expression of EST-encoded 
peptides, is equally applicable to any expressed product 
of a nucleic acid sequence, including full-length 
proteins . 

20 Techniques are available in the art by which cells 

can be genetically engineered to express the peptide 
encoded by a given EST fragment . The methods of the 
invention can then be used to identify antibodies 
specific for the peptide. These antibodies are then 

25 useful as reagents that can be employed in purification 
and identification of the full-length protein, and in 
other experimental procedures designed to elucidate the 
protein's location and function. 
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Prokaryotic hosts are, generally, very efficient and 
convenient for the production of recombinant proteins and 
are, therefore, one type of preferred expression system 
for EST fragments. Prokaryotes most frequently are 
5 represented by various strains of E. coll. However, 
other microbial strains may also be used, including other 
bacterial strains , 

In prokaryotic systems, plasmid vectors that contain 
replication sites and control sequences derived from a 

10 species compatible with the host may be used. Examples 
of suitable plasmid vectors may include pBR322, pUCllS, 
pUC119, and the like; suitable phage or bacteriophage 
vectors may include A,gtlO, Xgtll, and the like; and 
suitable virus vectors may include pMAM-neo, PKRC and the 

15 like. Preferably, the selected vector of the present 
invention has the capacity to replicate in the selected 
host cell . 

Recognized prokaryotic hosts include bacteria such 
as E. coll and those from genera such as Bacillus, 
20 Streptomyces, Pseudomonas, Salmonella, Serratia, and the 
like. However, under such conditions, the polypeptide 
will not be glycosylated. The prokaryotic host selected 
for use herein must be compatible with the replicon and 
control sequences in the expression plasmid. 

25 To express an EST fragment in a prokaryotic cell, it 

is necessary to operably link the gene sequence to a 
functional prokaryotic promoter such as the T7 promoter 
or RSC promoter. Such promoters may be either 

constitutive or, more preferably, regulatable (i.e.. 
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inducible or derepressible) . 



Examples of constitutive 



5 



10 



15 



20 



25 



promoters include the int promoter of bacteriophage X, 
the bla promoter of the p- lactamase gene sequence of 
pBR322, the CAT promoter of the chloramphenicol acetyl 
transferase gene sequence of pPR325, and the like. 
Examples of inducible prokaryotic promoters include the 
major right and left promoters of bacteriophage (Pl and 
Pj^) , the trp, reca, lacZ, Lad, and gal promoters of E. 
coli, the a-amylase (Ulmanen Ett at., J. Bacterid. 
162:176-182, 1985) and the sigma-28-specif ic promoters of 
B, subtilis (Oilman et al . , Gene sequence 32:11- 
20(1984)), the promoters of the bacteriophages of 
Bacillus (Gryczan, In: The Molecular Biology of the 
Bacilli/ Academic Press, Inc., NY (1982)), Streptomyces 
promoters (Ward et at., Mol . Gen. Genet. 203:468-478, 
1986), and the like. Exemplary prokaryotic promoters are 
reviewed by Glick (J. Ind. Microtiot . 1:277-282, 1987); 
Cenatiempo (Biochimie 68:505-516, 1986); and Gottesman 
(Ann. Rev. Genet. 18:415-442, 1984). 

Proper expression in a prokaryotic cell also 
requires the presence of a ribosome binding site upstream 
of the gene sequence -encoding sequence. Such ribosome 
binding sites are disclosed, for example, by Gold et at. 
(Ann. Rev. Microbiol, 35:365-404, 1981). The selection 
of control sequences, expression vectors, transformation 
methods, and the like, are dependent on the type of host 
cell used to express the gene. 

Host cells which may be used in the expression 
systems of the present invention are not strictly 
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limited, provided that they are suitable for use in the 
expression of the peptide of interest. Suitable hosts 
may often include eukaryotic cells. Preferred eukaryotic 
hosts include, for example, yeast, fungi, insect cells, 
5 and mammalian cells either in vivo, or in tissue culture. 
Mammalian cells which may be useful as hosts include HeLa 
cells, cells of fibroblast origin such as VERO, 3T3 or 
CHOKl, HEK 293 cells or cells of lymphoid origin (such as 
32D cells) and their derivatives. Preferred mammalian 
10 host cells include SP2/0 and JS58L, as well as 
neuroblastoma cell lines such as IMR 332 and PC12 which 
may provide better capacities for correct post- 
translational processing . 

In addition, plant cells are also available as 
15 hosts, and control sequences compatible with plant cells 
are available, such as the cauliflower mosaic virus 3 5S 
and 19S, nopaline synthase promoter and polyadenylation 
signal sequences, and the like. Another preferred host 
is an insect cell, for example the Drosophila larvae. 
20 Using insect cells as hosts, the Drosophila alcohol 
dehydrogenase promoter can be used. Rubin, Science 
240:1453-1459, 1988). Alternatively, baculovirus vectors 
can be engineered to express large amounts of peptide 
encoded by an EST fragment in insects cells (Jasny, 
25 Science 238:1653, 1987); Miller et al . , In: Genetic 
Engineering (1986), Setlow, J.K., et al . , eds . , Plenum, 
Vol. 8, pp. 277-297) . 

Any of a series of yeast gene sequence expression 
systems can be utilized which incorporate promoter and 
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termination elements from the actively expressed gene 
sequences coding for glycolytic enzymes which are 
produced in large quantities when yeast are grown in 
media rich in glucose. Known glycolytic gene sequences 
5 can also provide very efficient transcriptional control 
signals. Yeast provides substantial advantages in that 
it can also carry out posttranslational peptide 
modifications. A number of recombinant DNA strategies 
exist which utilize strong promoter sequences and high 
10 copy number of plasmids which can be utilized for 
production of the desired proteins in yeast. Yeast 

yS recognizes leader sequences on cloned mammalian gene 

^ sequence products and secretes peptides bearing leader 

sequences (i.e., pre-peptides) . For a mammalian host, 

f;^ 15 several possible vector systems are available for the 

= expression of and EST fragment. 

A wide variety of transcriptional and translational 
4^ regulatory sequences may be employed, depending upon the 

nature of the host. The transcriptional and 

20 translational regulatory signals may be derived from 
viral sources, such as adenovirus, bovine papilloma 
virus, cytomegalovirus, simian virus, or the like, where 
the regulatory signals are associated with a particular 
gene sequence which has a high level of expression. 

25 Alternatively, promoters from mammalian expression 
products, such as actin, collagen, myosin, and the like, 
may be employed. Transcriptional initiation regulatory 
signals may be selected which allow for repression or 
activation, so that expression of the gene sequences can 

30 be modulated. Of interest are regulatory signals which 
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are temperature -sensitive so that by varying the 
temperature, expression can be repressed or initiated, or 
are subject to chemical (such as metabolite) regulation. 

Expression of an EST fragment in eukaryotic hosts 
5 involves the use of eukaryotic regulatory regions. Such 
regions will, in general, include a promoter region 
sufficient to direct the initiation of RNA synthesis. 
Preferred eukaryotic promoters include, for example, the 
promoter of the mouse metallothionein I gene sequence 
10 (Hamer et al . , J. Mol . Appl . Gen. 1:273-288, 1982); the 
=:f TK promoter of Herpes virus (McKnight, Cell 31:355-365, 

n| 1982); the SV40 early promoter (Benoist et al . , Nature 

[j^ (London) 290:304-310, 1981); the yeast gal4 gene sequence 

promoter (Johnston et al . , Proc. Natl. Acad. Sci. 
yi 15 (USA) 79:6971-6975, 1982); Silver et al . , Proc. Natl, 

h Acad. Sci. (USA) 81:5951-5955, 1984), the CMV promoter, 

the EF-1 promoter, and the like. 

An EST fragment and an operably linked promoter may 
be introduced into a recipient prokaryotic or eukaryotic 

20 cell either as a nonreplicating DNA (or RNA) molecule, 
which may either be a linear molecule or, more 
preferably, a closed covalent circular molecule (a 
plasmid) . Since such molecules are incapable of 

autonomous replication, the expression of the gene may 

25 occur through the transient expression of the introduced 
sequence. Alternatively, permanent or stable expression 
may occur through the integration of the introduced DNA 
sequence into the host chromosome . 
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A vector may be employed which is capable of 
integrating the desired gene sequences into the host cell 
chromosome. Cells which have stably integrated the 
introduced DNA into their chromosomes can be selected by 
5 also introducing one or more markers which allow for 
selection of host cells which contain the expression 
vector. The marker may provide for prototrophy to an 
auxotrophic host, biocide resistance, e.g., antibiotics/ 
or heavy metals, such as copper , or the like. The 

10 selectable marker gene sequence can either be directly 
linked to the DNA gene sequences to be expressed, or 
introduced into the same cell by cotransf ection . Common 
selectable marker gene sequences include those for 
resistance to antibiotics such as ampicillin, 

15 tetracycline, kanamycin, bleomycin, streptomycin, 
hygromycin, neomycin, Zeocin™, and the like. Selectable 
auxotrophic gene sequences include, for example, hisD, 
which allows growth in histidine free media in the 
presence of histidinol. 

20 Additional elements may also be needed for optimal 

synthesis of single chain binding protein mRNA. These 
elements may include splice signals, as well as 
transcription promoters, enhancers, and termination 
signals. cDNA expression vectors incorporating such 

25 elements include those described by Okayama, Mol . Cell. 
Bio. 3:280, 1983. 

The recombinant antigen may be produced as a fusion 
protein. When two protein-coding sequences not normally 
associated with each other in nature are in the same 
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reading frame the resulting expressed protein is called a 
"fusion protein" as two distinct proteins have been 
"fused" together. Fusion proteins have a wide variety of 
uses. For example, two functional enzymes can be fused 
5 to produce a single protein with multiple enzymatic 
activities or short peptide sequences, such as epitope 
tags or affinity purification tags (see above) , can be 
fused to a larger protein and serve as aids in 
purification or as means of identifying the expressed 
10 protein by serving as epitopes detectable by specific 
antibodies , 

n| Epitope tags are short peptide sequences that are 

recognized by epitope-specif ic antibodies. A fusion 
protein comprising a recombinant protein and an epitope 
Ul 15 tag can be simply and easily purified using an antibody 

r% bound to a chromatography resin. The presence of the 

lit epitope tag furthermore allows the recombinant protein to 

4^ be detected in subsequent assays, such as Western blots, 

yi without having to produce an antibody specific for the 

20 recombinant protein itself. Examples of commonly used 
epitope tags include V5, glutathione-S- transferase (GST) , 
hemaglutinin (HA) , the peptide Phe-His-His-Thr-Thr , 
chitin binding domain, and the like. 

A fusion protein may be a means by which the 
25 recombinant antigen protein can be easily detected. For 
example, the fusion component can itself be a detectable 
moiety, such as fluorescent protein (fluorescent green 
protein, fluorescent yellow protein, and the like) , or 
alternatively can be one member of a specific binding 



pair (such as biotin and streptavidin, for example) which 
can be detected by reacting with the other member 
conjugated to a detectable substance . 

The foregoing elements can be combined to produce 
vectors suitable for use in the methods of the invention. 
Those of skill in the art would be able to select and 
combine the elements suitable for use in their particular 
system. 

The introduced nucleic acid molecule can be 
incorporated into a plasmid or viral vector capable of 
autonomous replication in the recipient host. Any of a 
wide variety of vectors may be employed for this purpose. 
Factors of importance in selecting a particular plasmid 
or viral vector include: the ease with which recipient 
cells that contain the vector may be recognized and 
selected from those recipient cells which do not contain 
the vector; the number of copies of the vector which are 
desired in a particular host; and whether it is desirable 
to be able to "shuttle" the vector between host cells of 
different species . 

Suitable prokaryotic vectors include plasmids such 
as those capable of replication in E. coil (for example, 
pBR322, ColEl, pSClOl, PACYC 184, itVX, pRSET, pBAD 

(Invitrogen, Carlsbad, CA) , and the like) . Such plasmids 
are disclosed by Sambrook (cf. ''Molecular Cloning: A 
Laboratory Manual", second edition, edited by Sambrook, 
Fritsch, & Maniatis, Cold Spring Harbor Laboratory, 

(1989)). Bacillus plasmids include pCl94, pC221, pTl27, 
and the like, and are disclosed by Gryczan (In: The 



26 



Molecular Biology of the Bacilli, Academic Press, NY 
(1982) , pp. 307-329) . Suitable Streptomyces plasmids 
include plJlOl (Kendall et al . , J. Bacterid. 169:4177- 
4183,1987), and streptomyces bacteriophages such as (t)C31 
5 (Chater et al.. In: Sixth International Symposium on 
Act inomycetales Biology, Akademiai Kaido, Budapest, 
Hungary (1986), pp. 45-54). Pseudomonas plasmids are 
reviewed by John et al . (Rev, Infect, Dis. 8:693-704, 
1986), and Izaki (Jpn. J. Bacterid. 33:729-742, 1978). 

10 Suitable eukaryotic plasmids include, for example, 

— 1=: 

BPV, vaccinia , SV4 0 , 2 -micron circle , pCDN3 . 1 
ni (Invitrogen) , and the like, or their derivatives. Such 

yi plasmids are well known in the art (Botstein et al . , 

Miami Wntr. Symp . 19:265-274, 1982); Broach, In: "The 
^ 15 Molecular Biology of the Yeast Saccharomyces : Life Cycle 

P and Inheritance", Cold Spring Harbor Laboratory, Cold 

£1 Spring Harbor, NY, p. 445-470 (1981) ; Broach, Cell 

X. 28:203-204, 1982); B (Dilon et at., J, Clin. Hematol . 

y| Oncol. 10:39-48, 1980); Maniatis, In: Cell Biology: A 

20 Comprehensive Treatise, Vol. 3, Gene Sequence Expression, 

Academic Press, NY, pp. 563-608 (1980) . 

Once antibody : antigen complexes have been formed and 
unbound antigen washed away under suitable conditions, 
the antibody : antigen complexes can be detected using one 
25 of several techniques known in the art . Suitable washing 
conditions are known to those skilled in the art (see, 
for example, Ausubel, et al. Short Protocols in Molecular 
Biology, 3rd ed. 1995) . Exemplary washing conditions are 
shown in the examples below. 
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For detection in the case of recombinant antigens, 
expression vectors can be used that form chimeric fusion 
peptides as described above. The epitope tagged antigen 
can be detected using an antibody specific for the tag 
5 sequence. This antibody may be itself detectably labeled 
or can be detected with a third detectably- labeled 
antibody. Alternatively, the antigen can be complexed 
with biotin and detected using detectably- labeled avidin 
or streptavidin . The antigen itself can also be 

10 detectably labeled, such as with a fluorescent dye 
^„ compound . 

■-in 

pjl The term "detectably labeled" as used herein is 

intended to encompass antigen directly coupled to a 
rf detectable substance, such as a fluorescent dye, and 

Ul 15 antigen coupled to a member of binding pair, such as 

Q biotin/streptavidin, or an epitope tag that can 

l;^ specifically interact with a molecule that can be 

4^ detected, such as by producing a colored substrate or 

fluorescence. 

20 Substances suitable for detectably labeling proteins 

include fluorescent dyes such as fluorescein 
isothiocyanate (FITC) , fluorescein, rhodamine, 

tetramethyl-rhodamine-5- (and 6) -isothiocyanate (TRITC) , 
Texas red, cyanine dyes (Cy3 and Cy5, for example) , and 

25 the like; and enzymes that react with colorometric 
substrates such as horseradish peroxidase. The use of 
fluorescent dyes is generally preferred in the practice 
of the invention as they can be detected at very low 
amounts. Furthermore, in the case where multiple 
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antigens are reacted with a single array, each antigen 
can be labeled with a distinct fluorescent compound for 
simultaneous detection. Labeled spots on the array are 
detected using a fluorimeter, the presence of a signal 
5 indicating an antigen bound to a specific antibody. 

The formation of antibody : antigen complexes can be 
performed under a variety of conditions to identify 
antibodies with varying binding characteristics. 
Antigen-containing reaction solutions can contain varying 
10 degrees of salt or be conducted at varying pH levels. In 
addition, the binding reaction can be carried out at 
vt varying temperatures. Each set of conditions will 

yi identify antibodies with different affinity for the 

antigen. For example, antibodies that bind at pH 2 may 
15 have utility under highly acidic conditions such as those 
r| that exist in the stomach. Similarly, antibodies that 

bind at temperatures near boiling may be useful in 
studying thermophilic organisms. In general pH 

d| conditions will range from 2-10 (most preferably around 

20 pH 8) , temperatures from 0° C - 100°C and salt conditions 
from 1 |UM to 5 M (in the case of NaCl) . 

Affinity constants are a measure of the interaction 
between a particular ligand and its cognate receptor. 
The "binding affinity" or the measure of the strength of 
25 association between a particular antibody : antigen 
interaction is generally measured by affinity constants 
for the equilibrium concentrations of associated and 
dissociated configurations of the antibody and its 
antigen. Preferably the binding of the antigen should 



29 



occur at an affinity of about ka= lO'^M or greater to be 
useful for the present invention, with greater than about 
lO'^M being more preferable, and most preferably between 
about 10'®M and about lO'^^M. Antibody fragments will 
5 generally have affinities in the range of about 10~^M to 
10''^M. 

In another embodiment of the invention, microarrays 
of uncharacterized antibodies are used to compare the 
protein expression profiles of cells. For example, 
10 comparisons can be made between a population of cells 
y from one tissue, such as arterial endothelial cells, and 

ni a second tissue, such as venous endothelial cells or from 

cells derived from a particular tissue but from different 
¥f species. Comparisons can be made between normal cells 

Ul 15 and cells from the same tissue type that originate from 

p an individual with a pathogenic disorder. For example, 

comparisons can be made between normal cells and cancer 
4^ cells. Comparisons can additionally be made between 

cells in a resting state and cells in an activated state, 
20 for example, resting T-cells and activated T-cells. 

In another example, the disclosed arrays are useful 
for evaluating the expression of proteins by pathogens, 
such as, for example, bacteria, parasites, viruses, and 
the like. A solution (such as a lysate) made from the 
25 pathogen which represents all proteins expressed by the 
pathogen can be used to contact an antibody array to 
identify antibodies recognizing pathogen-expressed 
proteins. These antibodies have utility as diagnostic 
agents as well as potential therapeutics. 
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Cellular lysates can be used as "antigens" as 
described above and reacted with two identical 
microarrays . Antibodies reactive in one array but not 
the other would indicate the presence of a differentially 
5 expressed protein. This antibody is then useful for the 
subsequent isolation and identification of those proteins 
that are different in two populations of cells. In the 
case of normal and cancer cells, for example, one may be 
able to identify proteins expressed in the cancer cell 
10 that contribute to its malignant state. 

In a further aspect of the invention, microarrays 
PI can be composed of previously characterized antibodies. 

=S IS 

iJI These microarrays have a variety of uses, one of which is 

'f^ cell profiling. For example, an array can be composed of 
^1 15 antibodies that recognize a set of antigens known to be 

p present in activated T-cells but not in resting T-cells. 

J=S A population of T-cells can then be lysed and the lysate 

==5^ contacted with the array to determine if the population 

4^ has the profile of activated or resting T-cells. 

20 Microarrays and the methods disclosed herein can be 

used in methods of diagnosing particular disorders. For 
example, a collection of antibodies specific for a range 
of antigens associated with one or more disorders can be 
arrayed and contacted with a bodily fluid containing 

25 antigens whose presence, or absence, would indicate a 
particular disorder. The advantage of using a microarray 
over a conventional immunoassay is the ability to include 
a population of antibodies diagnostic for a variety of 
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disorders on a single surface, significantly reducing 
time, costs and materials needed to effect a diagnosis. 

For example, if a patient presents with symptoms 
that are characteristic of several distinct disorders 
5 which can be distinguished on the basis of the presence 
or absence of one or more proteins, a single microarray 
assay could be used to make a specific diagnosis, thus 
allowing the patient to be properly treated. Patients 
suffering from stroke or brain infarcts release several 
10 proteins into cerebrospinal fluid, examples of which are 
'^1 neuron specific enolyse (NSE) from neuronal cells and 

Hi S-lOO from glial cells and astrocytes. Such proteins are 

yi not released in conditions that may have similar 

symptoms, such as drug reactions, making proper diagnosis 
15 more difficult. A diagnostic array could readily detect 

5 

Q these and other proteins in the CSF, leading to a rapid 

f5 clinical diagnosis and treatment. 

^ In another aspect of the invention microarrays are 

employed to characterize protein expression patterns 

20 using nucleic acid samples. Briefly, nucleic acid 

molecules from a whole cell or tissue are applied to a 
solid support using a microarray format. The arrayed 
nucleic acid samples are then contacted with a nucleic 
acid probe specific for a gene encoding a known protein. 

25 The probe solution is left in contact with the array for 
an amount of time sufficient to allow sample iprobe 
complexes to form, then the unbound probe is washed away 
under suitable conditions (see, for example, Ausubel, et 
al. Short Protocols in Molecular Biology, 3rd ed. 1995 




32 



and the examples below) . Bound probe is detected at one 
or more nucleic acid sample spots using one of a variety 
of detection methods. 

This aspect of the invention has a variety of uses. 
For example, the microarray can be constructed from 
nucleic acid samples isolated from a single tissue type 
but from a large number of species, with each spot 
representing a particular species. Thus in a single 
assay format one can determine the evolutionary 
development of the protein represented by the probe. 
Similarly, the microarray can be constructed of multiple 
tissue types from a single species, or from different 
developmental stages of a single species (or multiple 
species) thus simply and efficiently determining tissue 
expression of the protein represented by the probe. For 
example, a microarray can be constructed with arrayed 
samples representing all the developmental stages of 
Drosophila, a well known organism the study of which has 
led to a greater understanding of mammalian physiology 
and development , 

The nucleic acid sample can be messenger ribonucleic 
acid (mRNA) or can be complementary dexoyribonucleic acid 
(cDNA) , including EST fragments. Methods for extracting 
and isolating nucleic acids from cells are well known in 
25 the art (for example phenol extract ion/ethanol 
precipitation, ammonium acetate precipitation, cesium 
chloride gradients, and the like) , as are methods for 
generating cDNA (see, for example, "Molecular Cloning: A 
Laboratory Manual," second edition, edited by Sambrook, 
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Fritsch, Sc Maniatis, Cold Spring Harbor Laboratory, 1989; 
and Ausubel/ et al. Short Protocols in Molecular Biology, 
3rd ed. 1995, both of which are incorporated by reference 
herein) . Microarrays of these nucleic acids are created 
5 using the methods described above. Techniques for 

coupling nucleic acids to solid supports used to 
construct microarrays are well known in the art, 
including the poly-L-lysine and phenylboronic acid 
methods described in the Examples below. 

The nucleic acid probes used in the invention 
methods can be designed based on the sequence of a gene 
encoding a known protein or can be an EST fragment, as 
described above. One skilled in the art can readily 
design such probes based on the known sequence using 
methods of computer alignment and sequence analysis known 
in the art (e.g. '^Molecular Cloning: A Laboratory 

Manual", second edition, edited by Sambrook, Fritsch, & 
Maniatis, Cold Spring Harbor Laboratory, 198 9; Ausubel, 
et al. Short Protocols in Molecular Biology, 3rd ed. 
1995) . The probe can comprise any number of nucleotides 
but will preferably be not fewer than 10 nucleotides and 
preferably not more than about 300 nucleotides in length. 

The probes of the invention can be labeled by 
standard labeling techniques such as with a radiolabel, 
25 enzyme label, fluorescent label, biotin-avidin label, 
chemiluminescent label, and the like. After 
hybridization, the probes may be detected using known 
methods. Preferred labels are fluorescent labels, as 
described above . 
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The nucleic acid probes of the present invention 
include RNA as well as DNA probes and nucleic acids 
modified in the sugar, phosphate or even the base portion 
as long as the probe still retains the ability to 
5 specifically hybridize under conditions as disclosed 
herein. Such probes are generated using techniques known 
in the art . 

The term "hybridize" as used herein refers to a 
method of interacting a nucleic acid sequence with a DNA 
10 or RNA molecule in solution or on a solid support, such 
^1 as cellulose or nitrocellulose. If a nucleic acid 

^"^J sequence binds to the DNA or RNA molecule with 

Ul sufficiently high affinity, it is said to ''hybridize" to 

the DNA or RNA molecule. The strength of the interaction 
15 between the probing sequence and its target can be 
0 assessed by varying the stringency of the hybridization 

conditions. Various low to high stringency hybridizatxon 
^ conditions may be used depending upon the specificity and 

41 selectivity desired. Stringency is controlled by varying 

20 salt or denaturant concentrations. Examples of 

hybridization conditions are shown in the Examples below. 
Those skilled in the art readily recognize how such 
conditions can be varied to vary specificity and 
selectivity. For example, under highly stringent 

25 hybridization conditions only highly complementary 
nucleic acid sequences hybridize. Preferably, such 

conditions prevent hybridization of nucleic acids having 
even one or two mismatches out of 20 contiguous 
nucleotides . 



In a further aspect of the invention, microarrays 
can be composed of randomly generated polynucleotides 
(DNA or RNA) and contacted with proteins to identify 
unique binding pairs. Polynucleotides are now known to 
bind to proteins and may have potential as diagnostics 
and therapeutics (see, for example, Allen, et al. 
Virology 209 (2 ): 327-336 , 1995; Binkley, et al. Nucleic 
Acids Res. 23 (16) : 3198-3205, 1995). Polynucleotides can 
be evaluated in very large numbers using the methods 
disclosed herein thus increasing the likelihood of 
identifying a useful binder. 

The invention will now be described in greater 
detail by reference to the following non- limiting 
examples . 

Example I - Nucleic Acid Microarrays 

The following procedures are conducted at room 
temperature and using double distilled water unless 
otherwise noted. These methods are applicable to arrays 
of polypeptides or polynucleic acids. 

Glass slides are prepared as follows: NaOH (50 g) 
is dissolved in 150 ml of double distilled water (ddHsO) , 
then 200 ml of 95% EtOH is added while stirring. If the 
solution becomes cloudy, ddHsO is added until it becomes 
clear. Approximately 30 glass slides (Gold Seal, Cat. 
No. 3010) are soaked in the NaOH/EtOH solution for 2 
hours, shaking. The slides are then rinsed three times 
with ddHsO. The slides are next soaked in a poly-L- lysine 
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solution (70 ml poly-L- lysine (Sigma Cat. No. 8920) to 
2 80 ddH20) for 1 hour. Excess liquid is removed by 
spinning the slides in a rack on a microtiter plate 
carrier at 500 rpm. The slides are dried at 40° C for 5 
5 minutes, then stored in a closed box for at least 2 weeks 
prior to use . 

A cDNA microarray is prepared as follows: Total 
mRNA is isolated from tissue (for example, nerve cells) 
of a variety of species representative of different 

10 classes of organisms such as Drosophila, nematode, 
salmon, clam, chicken, mouse, dog, goat, spider monkey, 
chimpanzee, human, and the like, by the FastTrac method 
(Stratagene, La Jolla, CA) or other common methods. mRNA 
is also obtained from a variety of unicellular organisms 

15 such as E. coll, yeast, B, suhtilis , mycoplasma and the 
like. Eurokaryotic mRNA is enriched from total RNA using 
oligo(dT) cellulose (Ausubel, et al. Short Protocols in 
Molecular Biology, 3rd ed. 1995, pgs 4-11 - 4-12) . 
Equivalent amounts (for example, 1 |xg) of mRNA from each 

20 source are placed in a separate well of one or more 96 
well microtiter plates and precipitated with cold EtOH. 
The precipitate is rinsed with 70% EtOH and allowed to 
dry. 

The dried mRNA is resuspended in 3x SSC (sodium 
25 chloride/sodium citrate - 20X solution is 3 M NaCl 
(175g/L0 0.3 M trisodium citrate 2H2O (88g/L adjusted to 
pH 7.0 with 1 M HCl) then spotted onto a previously 
prepared glass slide using an array device (for example, 
Shalon and Brown (WO 95/35505, published 12/28/95)). The 
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prepared array can be kept for a long period of time 
before probing, however, if the slides are to be kept for 
long periods of time, stability is increased by 
converting each mRNA sample into cDNA using techniques 
5 known in the art, such as PGR. 

The array is rehydrated by suspending the slide over 
a dish of ddHjO (50*^C) for approximately one minute. The 
slide is quickly (approximately 3 seconds) dried by 
placing it on a surface heated to 10 0° C (mRNA side up) . 
10 The mRNA is crosslinked to the poly-L- lysine coating of 
the slide using ultraviolet radiation using a 
Stratalinker™ UV device according to the manufacturer's 
instructions (Stratagene) set at 60 milliJoules. 

The slides are next soaked in a solution of 5 grams 
of succinic anhydride (Aldrich Cat. No. 23,969-0) 
dissolved in 315 ml of N-methyl-pyrrilidinone (Aldrich 
Cat. No. 32,863-4) plus 35 mis of 0.2 M sodium borate 
(brought to pH 8,0 with NaOH) for 15 minutes with 
shaking. The slide is then transferred to a 95*^ C water 
bath for 2 minutes followed by 95% EtOH for 1 minute. 
Excess liquid is removed from the slides by spinning a 
rack of slides on a microtiter plate carrier at 500 rpm. 

A probe sequence of a known protein (for example, 
human nerve growth factor, GeneBank Accession No. E03589) 
25 is labeled using standard protocols, for example by using 
a CyDye Nick Translation kit (Amersham) . The labeled 
probe (approximately 1 jig/ml) is resuspended in 4X SSC 
(10 |il) to which is added 0.2 |il 10% sodium dodecyl 
sulfate (SDS) . The probe is boiled for 2 minutes, then 
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cooled for 10 seconds and transferred to the array by- 
pipette. The array is covered by a 22mm x 22mm cover 
slip, and the slide is placed in a humid hybridization 
chamber and submerged into a hot water bath 75°C) . 

The slide is left in the bath for 10 - 24 hours, 
then the cover slip is removed and the slide rinsed in 
0 . 2X SSC with 0.1% SDS several times. Excess wash buffer 
is removed by centrif ugation on a microtiter plate 
carrier as described above. The slide is scanned using a 
spectrof luorometer , such as the ScanArray 3000 (General 
Scanning Inc., Watertown, MA), For probes labeled with 
CyS , for example, fluorescence is measured at 670 nm. 

Localization of spots on the array to which the 
probe hybridizes indicates that the species represented 
by the spot expresses a protein similar or identical to 
the probe protein. 

The procedure outlined below is an alternative 
method for binding arrayed molecules to a solid support, 
using an SA (OCH2CN) -X-NHS linkage (see, for example, US 
5,594,111, issued 1/14/97; 5,648,470, issued 5/15/97; US 
5,623,055, issued 4/22/97; all of which are incorporated 
by reference herein) . 

Glass slides (Fisher Catalog No. 12-544-4) are 
soaked in an acid bath (1 hour in 0.1 M HCl) , then washed 
with water and dried at room temperature. The slides 
should not be aggressively dried, such as in an oven. 
The slides are next soaked in a silane solution overnight 
at room temperature (5% APTES ( 3 -aminopropyl- 
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triethoxysilane, Aldrich 28,177-8), 0.3% DIEA (Sigma) v/v 
in EtOH) . The slides may be sonicated for 10 - 15 
minutes right after being placed in the APTES solution. 

The slides are rinsed with isopropyl alcohol, then 
5 sonicated in isopropyl alcohol for several minutes. 
Sonication should remove any white silane residue on the 
slides. If the residue remains, the slides should be 
discarded. After sonication, the slides are left to 
cure/dry for at least 24 hours before use. 

10 The cured slides are next soaked in a linker 

solution overnight at room temperature. The linker 
solution is made by dissolving 115 mg of 9Y SA (OCH2CN) -X- 
COOH (Prolix, Bothell, WA) in 1 ml dimethyl formamide 
(DMF) plus 60 fil DIEA, then adding 60 mg TSTU (Sigma) and 

15 leaving for 15 minutes at room temperature. This stock 
is diluted in 270 ml of isopropyl alcohol plus 270 ja,l 
DIEA before using. 

The slides are removed from the linker solution and 
soaked in 1 M NH2OH, ImM EDTA, 0.1 M NaHCOs (pH 10) for 4 
20 hours at room temperature. This solution is removed, the 
slides are extensively washed with water then let air dry 
at room temperature. The slides can be stored at room 
temperature away from light before using to make arrays. 

Examp le II z Determin ation of Optimal Concentrations of 

25 Antibody and Antigen 

Various concentrations (1 ^g/^l, 100 ng/|4,l, 10 ng/jal, 
1 ng/^il) of total mouse IgG or a mouse monoclonal anti- 
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PLC-gamma were spotted on aldehyde slides (Cel 
Associates, Inc., Houston, Texas), which allow non- 
covalent attachment of proteins. Using a manual 8 pin 
hand arrayer the slides were blocked for 1 hour with PBST 
5 (phosphate buffered saline and 0.10% Tween 20), and 3% 
milk protein. The slides were subsequently washed three 
times, 15 minutes each, in PBST. Duplicate slides were 
incubated with 50 )il of goat anti-mouse IgG antibody 
(GAMG) conjugated with CY3 or CY5 fluorescent dye 
10 compounds (Amershan, Arlington Heights, Illinois) at 10 
|ug/ml or 1 jag/ml , Slides were then washed for 15 minutes 
in PBST three additional times and dried by 
centrif ugation prior to scanning. Binding was detected 
as shown in Table 1 below. 
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Table 1 



ATI L. 1 JDoay 


Cone . 


i\nu igen 


Cone . 


jje uec u xon 


PLC -gamma 


1 ua/ul 


GAMG-CY3 


10 )ig/ml 


+++ 




10 0 ng/|Lil 




10 ua/ml 


+++ 




10 ng/^1 




10 ^ig/ml 


+ 




1 na/ul 




10 }xg/ml 




mouse IgG 


1 ucr/ul 


GAMG-CY3 


10 i^g/ml 


+++ 




100 ng/)xl 




10 |ig/ml 


+++ 




10 no/ul 




10 |Lig/ml 






1 ncr/iil 




10 jxg/ml 




PLC -gamma 


1 uvct/liI 


GAMG-CY3 


1 )-ig/ml 


+ 




100 ncr/iil 




1 |Lig/ml 


+ 




10 na /ill 




1 ua/ml 






1 na/ul 




1 |Lig/ml 




mouse IgG 


1 Licr / 111 


GAMG-CY3 


1 uo /ml 


+ 




100 no/ul 




1 jxg/ml 






10 no/ul 




1 |ig/ml 






1 no/ul 




1 )ig/ml 




PLC -gamma 


1 ua/ul 


GAMG-CY5 


10 (ig/ml 


+++ 




10 0 no /ul 




10 ua/ml 


H- + + 




1 0 na /ul 




1 0 ua / ml 










1 0 ua /ml 




mouse IgG 


1 uo / U 1 


GAMG-CY5 


10 ua/ml 

^ V r^zp / ^^^^ 


H- + + 




10 0 na / ul 




10 ua/ml 


+ + + 




10 ncr/ul 




10 )ig/ml 


+ 




1 ng/^il 




10 |j,g/ml 




PLC -gamma 


1 Mg/i^i 


GAMG-CY5 


1 jag/ml 






100 ng/^1 




1 |xg/ml 


+ 




10 ng/^il 




1 |xg/ml 






1 ng/^l 




1 Mg/n^l 




mouse IgG 


1 Mg/Ml 


GAMG-CY5 


1 |ig/ml 


+ 




100 ng/^1 




1 |ig/ml 


+ 




10 ng/nl 




1 |ag/ml 






1 ng/^il 




1 |Lig/ml 





+++ strong signal, ++ moderate signal 
+ weak signal, - no signal 
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B?^amplg III - Comparison ot Solid Support s 

Serial dilutions (1 )ig/ml, 100 ng/ml, 10 ng/ml, 1 
ng/ml) of mouse IgG or PLC-gamma were hand arrayed onto 
aldehyde, polystyrene, nitrocellulose and Surmodics 
5 slides. Aldehyde, nitrocellulose, polystyrene and 

Surmodics slides were purchased from various outside 
vendors (aldehyde Slides - Cel Associates, Inc., Houston, 
Texas; nitrocellulose Slides - Molecular Probes, Inc., 
Eugene, Oregon; polystyrene Slides - Nunc, Inc., 
10 Naperville, Illinois; Surmodics Slides - Surmodics, Inc., 
Eden Prairie, Minnesota) . Surmodics slides have an 
undisclosed polymer on the glass surface which forms a 
covalent linkage with proteins under the appropriate 
conditions (described by the manufacturer) . 

15 Following hand arraying of the antibodies 

(approximately 20-30 nanoliters per spot) , the 
nitrocellulose, aldehyde, and polystyrene slides were 
immediately blocked for 1 hour with PBST and 3% milk, 
washed 3 times with PBST, and hybridized with 50 |ul of 

20 GAMG-CY3 for 30 minutes, Surmodics slides were incubated 
overnight in a moist salt chamber as recommended by the 
manufacturer. The following day, the Surmodics slides 
were processed as described above. Following 
hybridization all of the various slides were washed 3 

25 times in PBST, dried and scanned using a Scan Array 3 000 
fluorescent scanner . 

All of the slides tested allowed for the detection 
of antigen : antibody binding at higher concentrations of 
antibody. The Aldehyde and Nitrocellulose treated slides 
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were the most efficient at binding antibody, and 
antibody : antigen interaction could be detected at lng/|xl. 

Example IV r Detection of Binding Using Non- Fluorescent 

5 M et h o ds 

Positively charged nylon filters (Zeta Probe Membranes, 
Bio Rad Laboratories, Hercules, CA) were hand arrayed 
using 1 jul of anti-His, anti-V5, anti- thioredoxin (anti- 
Thio) , anti-FOS, anti -PLC -gamma and anti-CREB antibodies 

10 (Invitrogen, Carlsbad, CA; all antibodies were 
approximately 1 mg/ml) . Filters were blocked for 1 hour 
with PBST and 3% milk, washed three times with PBST, and 
incubated with 1 p,g/ml biotinylated Dl protein for three 
hours at room temperature. Dl is a creatine kinase 

15 fusion protein isolated from a human fetal heart cDNA 
library and cloned into the pBAD-Thio-His-TOPO vector 
(Invitrogen, Carlsbad, CA) to create a Thioredoxin- V5- 
His-creatine kinase fusion protein. Dl was biotinylated 
using the EZ-Link™ Sulfo-NHS-LC Biotinylation Kit 

20 (Pierce, Rockford, IL) used according to the 
manufacturer's instructions) . 

Following three additional washes with the same 
buffer, filters were treated with streptavidin/alkaline 
phosphatase conjugate or streptavidin/horseradish 
25 peroxidase conjugate (Boehringer Mannheim, GmbH Germany) 
for 1 hour at room temperature . 

The filters were washed 5 times with PBST, dried, 
and developed by immersion in ECL chemi luminescent 
substrate (ECL - Amersham, Arlington Heights, Illinois) 
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or the chromogenic substrate BCIP/NBT (Sigma Chemicals, 
St. Louis, MO) . Filters developed with ECL were exposed 
to Kodak chemiluminescent film for 1 to 10 seconds. 

The results are shown in Figures lA, IB, and IC. In 
5 all cases, only the antibodies specific for epitopes on 
the fusion protein antigen were detectable, and only in 
the arrayed spots, showing that the system has both good 
signal to noise ratio and specificity. 

The experiment was repeated using an array created 
10 with an automated arrayer. Antibodies (1 mg/ml) were 
spotted using an automated 96 pin microarrayer developed 
at Invitrogen. Fifteen negative control antibodies 

(assorted mouse monoclonals) were arrayed along with the 
three positive control antibodies (anti-His, anti-Thio, 
15 anti-V5) . Filters were treated as described above using 
the alkaline phosphatase conjugate and the chromogenic 
substrate BCIP/NBT. 

As can been seen in Figure 2, binding and detection 
of antibody : antigen complexes was highly specific and 
20 sensitive. 



Example V - Evaluation of Antibody Affinity 

Anti-His, anti-V5, anti-FOS, ant i- PLC -gamma, 25C1DG, 
and anti-VEGF (vascular endothelial growht factor) 
25 antibodies were arrayed on a nitrocellulose slide and 
reacted with biotinylated Dl protein as previously 
described. Binding was detected with streptavidin - Cy3 
as described above. The anti-V5 antibodies spots showed 
red, the anti-His spots showed green, while the negative 
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controls were undetecable (see Figure 3) . When viewed in 
a black and white drawing, relative increase in binding 
affinity is visualized by an increase of white in a given 
area. The color of the spots generally indicates a 
5 higher amount of f luorescently labeled antigen present, 
and thus indicates relative binding affinity between 
antibody and antigen. Colors, in decending order from 
highest to lowest affinity, are white, red, yellow, 
green, and blue. Using this technique, multiple 

10 antibodies can be tested for their affinity to a single 
-.^ antigen. 

Example VI - Polyclonal Antibody Microarrays 

To demonstrate specific binding to polyclonal 
yl 15 antibodies, six antibodies were arrayed by hand on a 

r| nitrocellulose slide, three polyclonal antibodies (anti- 

E12 (unpurified rabbit polyclonal sera to a His-V5- 
thioredoxin- thymidine kinase fusion protein) , anti-lexA 
(lexA repressor protein), and ant i-GFP (Green fluorescent 
20 protein)) and three monoclonal antibodies (anti-V5, anti- 
His and anti-GalU (a mammalian transcription factor) . 
The slide was blocked with PBST and 3% milk for 1 hour at 
room temperature, and incubated with the E12-biotin 
conjugate, prepared according to the protocol used for Dl 
25 protein. Following extensive washing with PBST, the 
slides were incubated with streptavidin-CY3 conjugate 
(Amersham, Arlinton Heights, IL) for 1 hour at room 
temperature, washed 5 times with PBST and dried by 
centrifugation prior to scanning on the Scan Array 3000. 
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As can be seen in Figure 4, binding was detected 
with both the antigen specific polyclonal antibody (anti- 
E12) and the antigen specific monoclonal antibodies 
(anti-His, anti-V5) and not with any of the negative 
5 control antibodies. 

B;?^ample VII - Mig^Qaryay Analysis ot J^abeled Cell Ji<ysate 

A series of experiments were conducted to determine 
if a microarray of antibodies could specifically detect 
10 antigens in a cell lysate. 

CHO cells expressing high levels of beta- 
galactosidase were grown to confluency in a T-175 flask. 
(Hams media with Pen/Strep, and L-glutamine plus 10% FCS , 
at 3 7°C with 5% CO2) Cells were harvested using 

15 Trypsin/EDTA. NP4 0 extracts were prepared by pelleting 
the cells (10^ cells) , washing once in PBS and 
resuspending in 5% NP40. Cell debris was removed by 
centriguation . Soluble protein was biotinylated using a 
Pierce biotinylation kit according to the manufacturer's 

20 instructions . 

Nitrocellulose slides (see above) containing arrayed 
monoclonal antibodies (anti-beta-gal, anti-His, anti- 
Thio, anti-V5, anti-FOS, ant i -PLC -gamma, anti-VEGF and 
25C10G (an anti-CREB antibody) were blocked, washed, 
25 hybridized and developed with streptavidin-CY3 as 
described in Example VI supra. As can be seen in Figure 
5A, beta-galactosidase binding was seen, however, some 
non-specific binding was detected as well. 
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The experiment was repeated, except that after 
centrif ugation of the extract, soluble protein was 
dialyzed overnight against 50 mM phosphate buffer at 4°C 
prior to biotinylation . As can be seen in Figure 5B, 
5 much of the non-specific binding seen in the previous 
experiment was eliminated. 

In the next experiment dialyzed extract containing 
the biotinylated soluble proteins was adjusted to 10% 
glycerol to reduce non-specific hydrophobic interactions. 
10 Furthermore, the sodium chloride concentration was 
O adjusted to 0.2 M NaCl to increase specific ionic 

ni interactions. All other conditions remained identical. 

?^ As can be seen in Figure 5C, all non-specific binding was 

eliminated using this protocol. 

15 

□ While the foregoing has been presented with 

p| reference to particular embodiments of the invention, it 

% will be appreciated by those skilled in the art that 

changes in these embodiments may be made without 
20 departing from the principles and spirit of the 

invention, the scope of which is defined by the appended 

claims . 
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